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^ Abstract 

We consider a general polling model with N stations. The stations are served 
_^ exhaustively and in cyclic order. Once a station queue falls empty, the server does 

not immediately switch to the next station. Rather, it waits at the station for the 
L^' possible arrival of new work ("wait-and-see") and, in the case of this happening, 

^ it restarts service in an exhaustive fashion. The total time the server waits idly 

is set to be a fixed, deterministic parameter for each station. Switchover times 
and service times are allowed to follow some general distribution, respectively. In 
some cases, which can be characterised, this strategy yields strictly lower average 
queueing delay than for the exhaustive strategy, which corresponds to setting the 
"wait-and-see credit" equal to zero for all stations. This extends results of Pekoz 
(Probability in the Engineering and Informational Sciences 13 (1999)) and of Boxma 
(^ et al. (Annals of Operations Research 112 (2002)). Furthermore, we give a lower 

OO bound for the delay for all strategies that allow the server to wait at the stations 

/yi^ even though no work is present. 
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1 Introduction and main results 

1.1 Introduction 

In this work, we consider a polling model in the sense of [I3]. In a polling model, one 
server serves several queues, called stations. The classical service procedures are the (a) 



exhaustive, (b) gated, and (c) limited strategies, where the server serves each station (a) 
until no more work is waiting at the respective station, (b) until all the work is served 
that was awaiting the server upon its arrival at the station, or (c) until the server has 
finished at most a predescribed number of jobs. The server then turns its attention to 
the next station. A possible (deterministic or random) idle time between the different 
stations, called switchover time, accounts for things like reloading or refueling. 

Recently, a few papers (most importantly [1] and p[2], also see |3] and (TB]) consider 
strategies where the server does not immediately switch from one station to the next if the 
queue there is empty. Rather, it possibly waits at the station for a while for the potential 
arrival of new messages. This is particularly useful if (i) the switchover times are random 
with sufficiently large variances and (ii) if the server is not likely to find much work at the 
other stations, that is, if the traffic intensity of the current station is much larger than 
those of the other stations. 

The strategy proposed and analysed in this paper is also of this type: Each station 
i is given a fixed wait-and-see credit Tj > 0. Once the server arrives at station i, it will 
work there whenever messages are waiting, but it will also wait (and see) at the station 
for a total time of Tj. Once the credit is used up and no more messages are waiting, it 
will switch to the next station. This strategy was considered by Pekoz [12] for the case of 
a completely symmetric system, where all arrival rates, service times, switchover times, 
and the Tj are identical for all stations. 

The main contributions of this paper are 

• to extend Pekoz' results to the general (that is, not necessarily symmetric) polling 
model and to show in particular that the asymmetry induces some new effects 
previously not observed, 

• to show that our strategy can be adjusted to provide lower delay than with the 
exhaustive strategy in several cases (which can be characterised and which also 
appear for deterministic switchover times), 

• to analyse the case of a polling model with two stations in detail and compare our 
strategy to the one proposed by Boxma et al. |1], and 

• to prove a lower bound for the delay for all strategies that allow the server to wait 
at a station even though no work may be present. 

As mentioned above, introducing a wait-and-see credit is particularly useful if the 
server is not likely to find much traffic at the other stations. This is because changing 
the station means to stay idle for a switchover time rather than resuming work at the 
current station within a short time. Surprisingly, we will see that this effect is largely 
independent of the length of the switchover times. 

So far, the advantage of additional idle times - as we apply them here - was ascribed 
to the random switchover times. The new observation is that using non-zero idle times 
is also particularly useful if the system is asymmetric, that is, one of the stations experi- 
ences much more traffic than the others, even though the switchover times may be even 



deterministic. This is an aspect that could not be observed in p^ , and even though being 
intuitive, we can quantify this effect precisely. 

In our polling model, the stations are served in cyclic order. We mention that the 
performance of all strategies can yet be improved by altering the order in which the 
server serves the different stations. For example, star polling can be applied if one of the 
stations experiences significantly more traffic than all other stations (see e.g. [2| [TT]). 



This paper is structured as follows. In Section 1.2, we describe the model in detail 



and introduce the relevant parameters. Section 1.3 contains a summary of our main 



results. We review related work in Section |1.4[ The proofs for the main results are 
given in Section |2] for the general case and in Section [3] for the refined results for polling 
models with only two stations. In Section |4| we prove a lower bound for the delay for all 
strategies that allow the server to wait at a station even though no work may be present. 
We highlight some possible further improvements and lines of future reseach in Section [5j 
The motivation for this work comes from a real world application: In so-called Ethernet 
Passive Optical Networks (EPONs, see |Hll9]), a service provider is connected to various 
end users via an optical fibre cable. Different optical wavelength channels may be available 
on the cable for the communication, but each wavelength channel can be operated only 
either upstream (messages are sent from end users to the service provider) or downstream 
at a given time. Switching from upstream to downstream operation or vice versa incurs 
an idle time (switchover time). Therefore, each channel of an EPON can be regarded as 
a polling model. 

1.2 The model 

We consider a polling model with N > 1 stations and one server which serves the stations 
in cyclic order. The stations are numbered i = 1, . . . ,N; because of the cyclic order, when 
we talk of the stations, we set A^ + 1 = 1. 

Each station i has its own queue which is fed by a Poisson arrival process whose 
intensity is denoted by A,. Each arriving message has a random length (also called service 
time). The mean and second moment of the message length distribution are denoted by 

(2) 

bi and b] , respectively, and are assumed to be finite. 

The behaviour of the server can be described as follows. The server arrives at station 
i and starts serving (FCFS) all waiting messages and newly arriving messages until the 
queue is empty. This is typically called exhaustive service in the context of polling models. 
However, once the station is empty or if the server finds an empty station upon its arrival, 
the server does not immediately switch to the next station; it rather turns idle for some 
time in order to wait for potentially newly arriving messages ("wait-and-see"). As soon 
as new messages arrive, it starts serving them immediately and in an exhaustive fashion. 
Once finished, it again turns idle and waits for new messages to arrive, and so on. 

The main feature of our model is that the server is set to wait idly for new messages 
for a total time Tj, where Tj > is a fixed parameter of the system, called wait-and-see 
credit. This total time can be spent altogether in one single period, for example, if there 
are no messages waiting at the station upon the server's arrival and no messages arrive 
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Figure 1: Operation of our polling model 



even until time Tj after the server's arrival at the station; or it can be spent in different 
periods - interleaved by different busy periods. Note that, since Tj is fixed, the server 
may not use any information about the current queue status at other stations nor about 
the future of the arrival process at any station. 

After the server has spent a total waiting time of Tj at station i, it starts the switchover 
to station i + 1. Hereby, it first spends a possibly random idle time, called switchover time, 
where it does not serve any messages neither at station i nor at station i + 1. The random 
switchover time from station i to station z + 1 is assumed to have finite mean rj > 

(2) 

and finite second moment r) . We will consider both non-deterministic and deterministic 
switchover times (in the latter case r^ = rf). 

The message generation process, the lengths of the messages, and the switchover times 
are assumed to be independent - both among each other and with respect to the other 
processes and stations. An illustration of the operation of the server is given in Figure [Tj 

The goal of this paper is to derive an explicit formula for the mean average delay of a 
job for this model in steady state, that is, the expected time a message experiences from 
the point in time when it arrives in one of the queues until its service starts (i.e. excluding 
the processing time). The expected delay of a message generated at station i is denoted 
by EDj, the mean average queueing delay is then defined by 
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where pi := Ajfej is the traffic load offered to station i and po := X]j=iP« i^ the total load 
offered to the system. We stress that the delays of the different stations are weighted 
by the traffic intensity pi, which implicitly includes weighting with the average message 
lengths, whereas the delays EDj do not include weighting the delay of the individual 
messages with their lengths. This seems to be common in the literature; e.g. Takagi [T3] 
calls this quantity intensity weighted mean waiting time (p. 92, [T3]). 

r 

hf\ Ti, ri, and rf ^ i = l,...,N. 

Furthermore, it will be convenient to use the following abbreviations. We abbrevi- 
ate by ro := J2i=i ^i ^^^ ^^^ of the mean switchover times and by Tq '^^ 



The mean delay will be expressed explicitly in terms of the above parameters Aj, hi 



rLrf' + 



Y2i j^i j^,- TiTj the second moment of the sum of all switchover times. Finally, we let 
To := ^j=i Ti denote the total "wait-and-see" time per cycle. 



1.3 Main results 

In this section, we give our main results. Theorem fTl gives a formula for the mean average 

— (2) (2) 

delay in terms of the parameters of the system Aj, 6j, b] , rj, and r^ , i = 1, . . . , iV, as 
well as the times Ti, i = 1, . . . ,N. This is simplified for the case of two stations, N = 2, 
in Corollary [2j 

The formula for the delay allows to investigate the following question: Given the 
system parameters Aj, 6j, bi , Tj, and ry , - how does one have to adjust the parameters 
Ti, i = 1, . . . ,N, such that the mean average delay is minimized. We will see that in 
many cases it is favourable - in the sense of lower average queueing delay - to choose 
positive Ti. This is either due to (i) the random switchover times or (ii) the asymmetry of 
the system. This is described in detail for the case of two stations in Theorems [3] and |4} 
where the effects (i) and (ii) are treated somehow in a decoupled way. 

Finally, in Theorem [9] we consider all strategies that allow the server to wait at a 
station even though no work may be present. We give a lower bound for the delay for 
any such strategy. 

Before we come to the main results, let us mention the stability condition for the 
system. Due to the exhaustive nature of our polling model it is clear that the system is 
stable if and only if 

Po < 1- 

We assume this condition from now on. 

Furthermore, we recall the result for the so called exhaustive strategy from [13j. In 
our model, this corresponds to the special case Ti = T2 = . . . = T/v = 0. In this case, one 
can find that 
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2(1 -Po) 2po(l-po) 2ro' 

In terms of queueing delay, it was shown in [10] that the exhaustive strategy provides the 
lowest delay in the class of all non-idle strategies, in particular, as compared to the gated 
and limited strategy. Therefore, it will serve as a benchmark for our strategy, which is a 
strategy allowing the server to be idle even though work may be present in the system 
(at other stations). 

The main theorem for our polling model is as follows. 

Theorem 1. The mean average delay of the polling model introduced above is given by: 

^ ^ Ef=iA.&r ^ (ro + To)(pg-Ef=iP.^) , W^^+roEt,T,iPo-P^) 
2(1 - Po) 2po(l - Po) Po('^o + To) 

(ro + io)Po[^ 2(1 -p,) i<^<;v 



The proof of this theorem is given in Section [2] Certainly, one can ask which values of 
Ti, . . . , T/v lead to a minimal queueing delay. In other words, given the system parameters, 
we would like to know how we have to set Ti, . . . , T/v in order to minimize D. Note that this 
is a non-trivial question, because the Tj appear in numerator and denominator. In fact, 
this is a minimization problem in the variables Ti, . . . ,Tjv, subject to the non-negativity 
restriction Tj > 0, for alH = 1, . . . , iV, which can be carried out - in principle - explicitly. 
We discuss the respective minimizers below for N = 2. Certainly, for large A^, one would 
solve the problem numerically. 

For two stations N = 2, the main result reduces to the following simpler formula. 

Corollary 2. The mean average delay of the polling model introduced above with N = 2 
is given by: 
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In particular, one can minimize (Q w.r.t. Ti and T2 subject to the restrictions Ti > 0, 
T2 > in order to obtain the minimal possible delay. Let us denote by T* and T2* the 
minimizers. We say that there is no gain from waiting at station i iiT* =0; if T* > we 
say that it is worth waiting at station i. 

From the explicit expression above, one can observe the following consequences. First, 
we consider a partially symmetric polling model, by which we only mean that both sta- 
tions have the same intensities pi = p2, but not necessarily the same switchover time 
distribution, message length distribution, nor arrival rate (cf. [12]). 

Theorem 3. Consider a polling model as introduced above with two stations. In the case 
of a symmetric polling model, pi = P2, the following holds. 



With deterministic switchover times, that is, rf 



,(2) 



and To 



r2 ; we get T* 



T2 =0. I.e. in this case there is no gain from waiting at either station. 

With non- deterministic switchover times, that is, rf < r[ or r2 < r2 , it is worth 
waiting (at both stations) if and only if 



2pi<l 
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In this case, the optimal waiting time T* 



l-2pi 

T2 > can be calculated explicitly. 



see (26). The minimal delay is then given by inserting (26) into (tw; this delay is 
strictly lower than the mean average delay induced by the exhaustive strategy. 



We remark that the fraction on the right-hand side of ^ equals 

{E[R^ + R2]f 

var[i?i + R2] + (E[i?i + i?2])^ yE& ' 



(3) 



where Ri are independent switchover times for switching from station i to station i + 1, 
respectively. 

Now we consider an asymmetric polling model, i.e. pi > p2- 

Theorem 4. Consider a polling model as introduced above with two stations. In the case 
of an asymmetric polling model with deterministic switchover times, that is, assuming 
Pi > P2 oind rf = r[ , r2 = r2 , the following holds. 

• There is no gain from waiting at station 2, i.e. in all cases Tg* = 0- 

• Further, it is worth waiting at station 1 if and only if 

Pi-pl + pl-p2- 2piP2 > 0. (4) 



In this case, one can calculate the minimizer T^ > explicitly as in (29). The 



minimal delay is then given by inserting (29) and Tg = into M); this delay is 



strictly lower than the mean average delay induced by the exhaustive strategy. 

Remark 5. A similar discussion is possible for the case of an asymmetric polling model 
with non-deterministic switchover times. There, both of the following effects will be 
combined. Namely, note that Theorem [3] shows that large variances of the switchover 
times (increasing the variances of the switchover times in condition (pi), cf. (|3|) lead to 
the situation where it is worth waiting (at both stations). On the other hand. Theorem |4] 
shows that a strong asymmetry (reducing p2 in condition (|4])) makes it useful to wait at 
the station with significantly higher traffic intensity. These effects will both be present in 
the case of a not necessarily symmetric system with non-deterministic switchover times. 

Remark 6. We remark the following rather surprising fact: Note that ^ does not depend 
on the switchover times. So, the question whether it makes sense to wait at station 1 
only depends on the relation of the intensities pi and not on the length of the possible 
idle period due to the switching. Similarly, the expression in ^ does not depend on 
the absolute lengths of the two switchover times (one can multiply both Ri by the same 
constant without changing ([2]), cf. ([3])) nor on the order of the switchover times (but only 
on the sum). However, even though the decision whether to wait or not does not depend 



on the absolute length of the switchover time, the resulting credit does (cf. (26)). 
Remark 7. Similar discussions are possible for N > 2 since D has the form 

TAT^ + fb^ + a 

D = c^ . 

ro + To 

with some A^ x A^- matrix A, constants c, a, b, and T := (Ti, . . . , T^). 



Remark 8. The trivial case A^ = 1 is included in our results. It corresponds to a single 
queue where the server takes vacations. In our model, funnily enough, it takes a vacation 
(switchover time) after it has spent a total idle time of Ti at the queue. The corresponding 
delay is 



2(1 -pi) 2(ri+Ti)' 

which is obviously minimized for Ti = oo, which also is easy to interpret, because then the 
server is never allowed to take a vacation; and the system is thus identical to an M/G/1 
queue. 

Finally, we discuss a lower bound for the delay for strategies that allow waiting times 
of any type. We recall at this point that [10] shows that the exhaustive strategy provides 
the lowest delay in the class of all non-idle strategies, that is, all strategies where the 
server is not allowed to wait at a station if no work is present there. 

In the following, we consider strategies that are not allowed to use future information 
of the system, that serve FCFS, and where the server is not idle if at its present station 
messages are waiting to be served. Further, we have to assume that with this strategy 
the system has a steady-state distribution. 

The next theorem gives a lower bound for the delay for all of these strategies where 
the server is allowed to wait at stations due to reasons that depend only on the current 
station in the current cycle (that is, since the server arrived at the present station). This 
restriction considers those strategies that look at the evolution of the traffic at the present 
station since the server arrived there. It does not allow strategies that take their decisions 
according to e.g. the queue status at different stations or the recent switchover times. 

This provides a lower bound, in particular for the model treated so far, the strategy 
proposed in [4j, as well as the strategy proposed in Section [s] below. 

Theorem 9. Consider a polling model where the stations are served in cyclic order. Then, 
for any strategy that allows the server to wait at a station even though no work is present 
there but the decision on whether and how long to wait only depends on the evolution of 
queue of the current station since the server arrived at the station, we have 
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where aj := pjj::^ + fj and /o = Y.i=i fi- 



The idea behind this theorem is that fi is the expected time the server spends waiting 
at station z in a cycle. The time as such is random for a general strategy, of course. Since 

8 



the fi are unknown in general, the minimum appears. In the case of the concrete model 
treated so far, we had /j = Tj, because by definition the total time the server spends at 
station i is deterministic and equals Tj. 

The minimum in ^ can be calculated explicitly in principle as well as numerically 
without any problem. A proof of Theorem [9] is given in Section |4j 

1.4 Related work 

Basic references on polling models are [I3], [H], [15], [TO] . 

References that refer to polling models where the server may be waiting at a station 
are apparently rare. The main references for us are Pekoz [12] and Boxma et al. [4j. 

Pekoz introduced the strategy we use in this paper for the completely symmetric model 
(that is, all the arrival rates, service times, switchover times, and the Tj are identical). 
In particular, his Theorem 2.2 is a special case of our Theorem [Tj In the present paper, 
we consider the general polling model. Furthermore, a new observation is that also a 
sufficiently asymmetric system can make it useful to wait at a station, independently on 
whether the switchover times are random or not. 

The second main reference is Boxma et al. [1], where a polling model with N = 2 
stations is analysed. In that work, the following situation is investigated: If the server 
encounters an empty queue at station 1 once it arrives there, a "wait-and-see" timer is 
activated in order to wait for the possible arrival of new messages. However, - contrary to 
the present setup - once the server has finished some work or the timer has run out, it will 
immediately switch to the next station. We compare the resulting delay obtained from 
this strategy to ours in Figure 2. We have found cases, where our strategy leads to lower 
delay than the strategy proposed by Boxma et al. and also cases where it performs worse. 
The latter is usually the case if the intensities pi,p2 are close to each other, whereas in 
the case that we deal with a highly asymmetric system, our strategy seems to be better. 
Also, for large switchover times our strategy seems to perform better than [1] , since in this 
case the timer from [1] is rarely activated. Unfortunately, it does not seem to be possible 
to compare the strategies directly due to the non-explicit nature of the delay formulas in 

Further references on polling models where the server may be waiting consider only 
single-station systems with vacations (|i3j and [16j). 

The strategy employed in [1] and in the present paper is somehow related to a so-called 
forced idle time. We refer e.g. to [5l [6] for some work on this. However, in the present 
setup, the server is not forced to be idle; whenever it is set to "wait-and-see", it rather 
resumes service as soon as new messages arrive. This is the reason we prefer the term 
"wait-and-see" rather than "forced idle time". 
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Figure 2: 

Comparison to Boxma et al. j3j, 
delay vs. credit (resp. timer) at stat. 1. 
Note that in the first plot the strategy 
from |1] provides lower delay; in the 
second plot only the arrival rate of the 
second station is changed and then our 
wait-and-see strategy has lower delay; 
in the third plot the same happens 
due to longer switchover times 



2 The fundamental relations for the general polling 
model 

In this section, we derive the fundamental relations for the general polling model that 
allow us to obtain the formula for the mean average queueing delay. In particular, we 
give a proof of Theorem [TJ We proceed in several steps. 

Cycle time. The cycle time is defined to be the time that the server takes from one 
arrival at station 1 to its next arrival at the same station. We obtain the average cycle 
time, EC. 

First, let us define more precisely the notion of the server being idle, switching, and 
waiting. The server is waiting when it is at some station waiting for messages to arrive. 
Note that, by the definition of our strategy, the total time the server spends waiting in 
each cycle equals Yli=i ^i- '^^^ server is said to be in the state of switching from the time 
it leaves one station until it starts working at the next station. Finally, we say that the 
server is idle if it is either waiting or switching. 

Now, note that a cycle contains periods when the server works and periods when it 
is idle. In our polling model, the server is idle exactly for the time it waits and for the 
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switchover time. Thus, the expected time the server is idle in a cycle equals 

TV N 

{l-po)EC = Y,r^ + Y,'^^■ (6) 

This allows to deduce the expected cycle time in our polling model in steady state: 

EC = '^^^±^. (7) 

1 -Po 

The decomposition principle. We will use a decomposition principle to deduce our 
results. One can find a proof of this principle in other contexts e.g. in [H H]. We omit the 
proof for our system since it is completely analogous. 

In order to formulate the decomposition principle, we need the notion of the workload 
V, which we define to be the sum of all message lengths that are present in the system 
(including the remaining length of the currently processed message) at a random point in 
time in steady state. 

Then the decomposition principle says that V has the same distribution as 

Y £ yM/G/l j^ yidle 

where \/^/^/i is the workload in the same polling model with no switching or waiting 
times, that is, an M/G/1 queue. On the other hand, V^'^^'^ is the workload at a ran- 
dom point in time given that the server is idle at that point, and V^^'^^^ and V^'^^'^ are 
independent. 
Let 

P (server switching) 
g : = P (server switching | server idle) 



P (server switching) + P (server waiting) 

Therefore, 

EV = E1/^/^/^ + gEr"™^*^^'''^ + (1 - g)E1/™^'*^''^ (8) 

where v^s^itchmg ^^^ ^waiting gj.Q ^Yie workloads at a random point in time given that the 
server is switching and waiting, respectively, at that point. 

Expected workload. We now calculate KV in two different ways. On the one hand, 
note that 

N N ^(2) 

E,V = >^6jE[# messages in queue at station i] + /^Pt-h-- (9) 

1=1 1=1 

Indeed, the first term accounts for the fact that there are messages that are not yet in 
service and that are waiting at the different stations. The second term corresponds to the 
fact that with probability pi we are looking at station i and a message is being processed 
there. The workload of that message is exactly its expected residual lifetime, that is, 

&fV(2&.). 
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By Little's law, the last equation becomes 

N N ,(2) N N ,(2) N ,(2) 

i=l i=l i=l i=l i=l 

This equation shows that, in order to obtain the mean average queueing delay of our 
system, D, we have to calculate the expected workload EV^. 

On the other hand, we use the decomposition principle n8«. Clearly, EV^^*^^^ is known: 



El/M/G/ 



2(1 - Po) ' 



see e.g. [7], p. 201. Therefore, we obtain with rtsl) and (10) that 



2(1 — Po) ~f 20j 



i=l 



We are left with calculating the expected workload given that we find the system in 
the state of switching and, respectively, the expected workload given that we find the 
system in the state of waiting, as well as some relation between time periods of switching 
and waiting. As concerns the latter, in fact, it is sufficient that we clearly know that 



q 1 1 



P (server switching) P (server switching) + P (server waiting) 1 — Po 

and 

1-g 1 1 



(server waiting) P (server switching) + P (server waiting) 1 — Po 



:i2) 



(13) 



Workload present while switching. Observe that 

1 ^ 

^yswitching _ ^ ^ y" piEvr'''^'''^ (14) 

P (server switching) -^ 

where Pi is the probability of encountering the server in the state of switching from station 
i to station i + 1 when entering the system at a random point in time and EV^*^™' '^ ™^ is 
the expected workload at such a point in time. 

Clearly, pi = Vi/KC, since this is the fraction of time in a cycle that the server spends 
switching from station i to station i + 1. 

Now, we have to find Ey^^™'*'^ "^, the expected total amount of work that is present 
given that we look at the system at a point when the server is switching from station i 
to station i + 1. Let us assume we are at such a point in time; then there are different 
times when the currently present workload was generated. We distingish these times and 
determine the respective workload: 
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At all stations workload was generated during the current switching period. It is 
given by the expected residual lifetime (in fact, the expected backwards recurrence 



.(2) 



time) of the current switching period: ^ ■ J2j=iPj- 

At all stations, except for the i-th station, workload was generated while the server 
was at station i (working and waiting). The time spent working has mean pjEC, and 
the time spent waiting equal Ti, so that the generated workload becomes {piEC + 



• Similarly, while the server was at station k (working and waiting), k ^ i, at all 
stations (except for those that later in the cycle became emptied) workload was 
generated and is still present. The time spent working has mean pk^C, the time 
spent waiting equals T^, so that the generated workload becomes 

(pkEC + Tk)- Yl P^^ (^^) 

ie{i+i,.--,fc-i} 

where {i + 1, . . . , A; - 1} is defined as {i + 1, . . . , A^} U {1, . . . , A; - 1} if ? + 1 > k-l. 

• During the switchover time from station k to station k + 1 {k ^ i), which takes on 
average r^, workload was generated at all stations (except for those that later in the 
cycle became emptied): 

Tk- Yl Pr (16) 

je{i+l,...,k} 

Summing up all this workload, we get 

j<i \i=i+i 1=1 / j>i i=i+i 

/ N j~l \ j-1 

+Y.p^^^ [Y.p^ + Y.pi] +Y.p^^^ E pi 

j<i \l=i+l 1=1 / j>i l=i-\-l 

/ N j-1 \ j-1 

+Y.^AY.p^ + Y.p^]+Y.^^J2p^ 

j<i \/=j+l 1=1 / j>i l=i+l 

(2) 

+PiEC{po-p,) + {po-p^)T, + po'^. (17) 

Workload present while waiting. Analogously to the workload while switching, we 

observe that 

1 ^ 

^p^waiting _ ^ YqiEVr''"^ (18) 

P (server waiting) -^ 

where qi is the probability of finding the server waiting (in a wait-and-see state) for 
messages at station i and KV^^^ '°^ is the workload one would find at such a point in time. 
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Clearly, qi = Ti/KC, since this is the fraction of time in a cycle that the server spends 
waiting at station i, by the definition of our polhng model. 

Similarly to the workload while switching, we obtain the expected workload generated 
while station i is in the state of waiting. Let us assume we are at such a point in time, 
then there are different times when the currently present workload was generated. We 
distinguish these times and determine the respective workload: 

• When the server started working at station i, there was work waiting there. We 
denote the length of the "busy period" generated by this waiting traffic by Zi. The 
workload generated at the other stations during this busy period is 

EZ,-{po-pi). (19) 

In order to determine EZj note that the average time the server spends working at 
station i is, on the one hand, piEC. On the other hand, the time the server spends 
working at station i consists of the length of the first busy period, EZj, and all other 
busy periods generated, which is the number of busy periods in an M/G/1 queue 
with total idle time Tj. However, the expected number of busy periods in an M/G/1 
queue with total idle time Tj is AjTj (just disregard the time of the busy periods, 
then because of the memoryless property, the number of busy periods is Poisson 
with intensity AjTj). 

Thus, 

PiEC = EZ, + XiTi -^, 
1 -Pi 

since j^— is the average length of one busy period at station i. This equation allows 
to calculate EZj. 

• Workload was generated at all other stations except for the i-th, during the busy 
periods that have already taken place at station i, not considering the first busy 
period when the server started to work at station i. Per busy period, a workload at 
the other stations of in total 

(Po - pi) 



1-Pi 



6, 



was generated, since j^— is the average length of one busy period at station i. In 
order to obtain the number of busy periods that have already taken place, note that 
these are on average AjTj/2, because the waiting time is deterministic. Thus, we 
obtain 

^ T^ ■ (Po - P.) (20) 

Z L Pi 

for the total workload that was generated at all other stations during the busy 
periods (except for the very first one) at station i. 
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The total workload that was generated at all other stations during the waiting time 
spent so far at station i is with the same reasoning 



T,; 



■ (po - Pi 



(21) 



The term in (15) has to be considered in the same way. 



The term in (16) has to be considered in the same way. 



Summing up all this workload, we get 

N 



EV 



waiting 



j<i \i=i+i 1=1 / j>i i=i+i 

j<i \l=i+l 1=1 / j>i l=i+l 

/ N i-1 \ J-1 

+T.^AT.p^ + Y.p^]+Y.^^Y.p^ 

j<i \l=i+l 1=1 / j>i l=i+l 

+(po-p.)(p.EC+|(l-^ 



(22) 



Proof of Theorem jT| In order to see the formula in Theorem[T| one just has to combine 
(|ig, (0, and (g, ([Ml), ^, and (|l|), (|l8|, (g. D 



3 The case of two stations 

In this section, we prove Theorems |3] and |4j First we prove that the optimal parameters 
in the two-station case satisfy a linear relation. Then we prove Theorem M (symmetric 
case, deterministic and non-deterministic) and Theoremll] (asymmetric and deterministic), 
respectively. 
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For simplicity, we introduce the following abbreviations: 



Cl 
C? 



2(1 -Po) 

2 (2) 

PiP2rg Por^ 

1-po 2 

2p2Piro 



1 -Po 
2piP2ro 

1 -Po 



'"0P2 

^opi + 

2p2Pi 

1-Po 

^ P2 A _ Pi 

2 2 V 1-Pi 

^ Pi A _ P2 

2 2 V I-P2 



An easy calculation shows that these are non-negative constants. With these abbre- 
viations, formula ([I]) becomes: 



D = ci + 



C2 + C^Ti + C4T2 + C5T1T2 + CqTI + CyTl 



(23) 



Lemma 10. On the set tq + Tq > 0, the minimizers of the quantity in (23) satisfy the 
following linear relation: 



(C5 - 2cq)T^ =C3-C4 + (c5 - 2C7)T2*. 

In particular, in the symmetric polling model (pi = P2), we must have 



(24) 
(25) 



We remark that the above minimizers can be negative. Recall that we are interested 



in the optimal parameter, which are the minimizers of (23) subject to the restriction 



Tl^T^ > 0. This is why we distinguish in the following between the minimizers of (23) 
and the optimal parameters. 



Proof of Lemma 10; Clearly, (23) shows that D can be written as follows: 

/(Ti,T2) 



D = D{T,,T2)=c,+ 



Po{ro + Ti + T2 



with some function f.UD has a minimum at T* and T2 (with tq + T^ + T2* > 0) it must 
satisfy 

dD dD 

-{T*,T;) = and 7^(Ti*, T^) = 0. 



dTi 



dTo 
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Due to the fact that the denominator is a hnear function in Ti + T2, an easy calculation 
shows that we must actually have 



This is 



C3 + C5T2 + 2c6T* = C4 + C5T* + 2C7T2*, 



exactly as asserted in (24) 



In the symmetric case we have pi = p2 < 1/2 and C3 = C4 and C7 = cq which implies 

D 



that (24) becomes (25). 



Symmetric polling model. We now consider a symmetric polling model, i.e. pi = 

Proof of Theorem J3| Assume that T^ > and Tg > are the optimal parameters. 
Then we know from (25) that T^ = Tg =: T. Therefore, we obtain: 



D = ci 



(r(^) + Pii) + 2(ro + S)T + (t^ + 1 - t^)T 



2(ro + 2T) 
The minimum of this expression is attained at 



1 1 

- - rn H — 
2^2 



T* = -; ro + ^ xlArlp - Srg + ( rf + r^ ^ 



l-2p 



(4 - 12p + 8p2) 



(26) 



Let a := Tq + r^jA^-- The condition for T* to be well-defined and positive is: 

4r2p-3r2 + a-4(l-p)(l-2p)>r2. 

This is true if and only if 

a(l - 2p) > rl 

which is easily seen to be equivalent to what we stated in (pi). 

-2p 



In the deterministic case, a = Tq + '"oi^"' ^^^ condition becomes 



2p< 1- 



1 



r^ + r^^^ 



p ' 



'0 "T '0l_2p ^ + l-2p 

which can easily be seen to lead to the contradiction p < 0. 



D 
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Unsymmetric polling model with deterministic switchover times. We now con- 
sider an a asymmetric polling model, i.e. pi > p2 with deterministic switchover times, i.e. 
r[ = r^ and r| = rf. 

Proof of Theorem [4], first part: Recall that we would like to show that there is 
no gain from waiting at the station with less traffic, that is, station 2 in our case. We 
distinguish two cases: pi > 1/2 and pi < 1/2. 
First case: pi > p2 and pi > 1/2. 



Note that trivially p2 < 1/2. Recall that the linear relation (24) holds for the mini 



mizers of ( 23 ) (which is the same as 



C5-2C6 



In). Since pi > 1/2 and p2 < 1/2 we get 



<0 

and ':^~^':'^ < 0. Therefore, due to (24) one of the minimizers T* or T2* must be negative. 



C3-C4 

C5-2C6 







Therefore, the minimizers subject to the restriction T^,T2 > must satisfy either Tg 

or T* = 0. However, the second case can be excluded easily: If we set Ti = in (23) and 

optimize in T2 we would get: 



This can be seen to be negative, because 



2 C2 — C4ro 



C7 







C2 - c^ro < 






follows from 












-rlpi - 


PiP2ro 1 (2) 2I/ 
1 "+9P0^0 =^L(P2- 
i - Po / / 


-Pl)- 


PlP2'^0 

1-po 



<o, 



which holds since P2 < Pi- 

Therefore, the case Tj* = 0, T2 > can be excluded; and we must have Tj* > and 



T2* = for the minimizers of (23) subject to T^,T^ > 



Second case: pi > p2 and pi < 1/2. 

First let us rewrite the delay formula ([T|. We exclude the trivial case ro = and set 
5*1 := Ti/tq and 6*2 := T2/rQ. Then ([I| becomes: 



n , '^o / P1P2 ., , c- , c\ , bo + '^iP2 + '^2Pi + 5?f T^ + 5|^T 
D = ci^ (1 + 61 + 62) H 



2pi l-2p2 
P2 



Po \ 1 - Po 



1 + 5i + 52 



(27) 



With the notation 5*1 and 5*2, the linear relation (|24|) becomes 

C3 - C4 1 , C5 - 2C7 



C5 - 2c6 ro C5 - 2c^ 



^ _ pi- p2 , PiT^^^ 

•^2 — 1„9n, H 1_9n, '-'2- 



P2 i_p^ P2 i^p^ 



Setting c := ^\lg^ and 6 : 



^2T3^ P2-i_^^ 



l-2p2 

' 1 -~2?i , this is 



Si = c + bS2- 



(2J 
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Consider D = D{Si,S2) (given in (27)) as a function of 5*1 and 5*2. It suffices to 



consider the function D{Si,S2) only at those points that satisfy the hnear relation (28), 
that is, D{c + 65*2, 5*2), 5*2 G [0, oo). We are finished if we can show that the derivative of 
this function w.r.t. 5*2 at is non-negative, since then the optimum must be attained for 
negative 5*2, and thus negative T2, which is impossible. Let g{S2) := D{c + bS2, S2). 
The function g can be written as follows: 



^(^2) 


= C,+ ''IP'P' {l + C + bS2 + S2) 

Po V 1 - Po 




ip0 + (c + 652)p2 + 52Pl + (c + 652)2f^ + 5|^i^\ 




' l + C + bS2 + S2 J 


Clearly, 




^'(0) = 


Po V 1 - Po 




(l + c)(p26 + Pl + Cp2^)-(l + fe)(ipO + P2C + c2f^) 



(1 + C)2 

We would like to show that g'{0) > 0, which is true if and only if 

piP2{l + b){l + cf l-2pi Po ,c2p2l-2pi 

z + P2& + P1 + CP2- 6 1 + c - — +P2C+— 1 + fe > 

1 - Po 1 - Pi 2 2 1 - pi 

After some calculations, it can be seen that this is equivalent to 

-2pip2 - Pi + P? + P2 - P2 + 2piP2 + 2piP2 < 

which is easily seen to be always satisfied in the case pi, p2 < 1/2. □ 

We have seen that in the symmetric polling model with deterministic switchover times 
there is no gain from waiting at the station with less traffic (station 2). Now, we determine 
when it is useful to wait at the station with more traffic (station 1), and what is the 
optimal waiting time Tj* in this case. It turns out that the condition is (|4]); and the 
optimal parameter is given by: 



Tr = -ro + ^/rg + ^^^^; (29) 



and the corresponding delay is then obtained by plugging in (29) and Tg = into (23). 



Proof of Theorem [4], second part: We get the optimal parameter if we set T2 = in 



(23) and differentiate w.r.t. Ti. Then the minimizer is seen to be given by (29). Condition 
(iTcorresponds to T^ > 0: in order for (29) to be positive, we must have 



C2 - csro > 0, 

which translates into 

Po ^ , P2P1 

^ >P2 + z. , 

2 1-po 

and thus ^ appears. D 
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4 Lower bound 

The goal of this section is to give a proof of Theorem |9] For this purpose, let us define 
the following random variables. We denote by Fi the time in steady state that the server 
spends at station i waiting in a cycle, that is, being idle because there is no work at that 
station. Furthermore, let fi := EFj, fl := E,F^, and /o := X]i=i /«• 

For a general strategy basically nothing can be said about the distribution of the F^ 
even about their means fi. The idea behind the proof of Theorem [9] is to estimate the 
mean average delay by an expression in terms of fi and /^ , to estimate by Jensen's 
inequality, 

fl"^ > f!, (30) 

and thus to obtain an expression that only depends on the /«. Then, minimizing over 
/i > 0, we obtain the lower bound ([s]). The details are as follows. 

Decomposition. Let V be the workload as defined above. Analogously to the decom- 
position principle in ([s]) one can show that 

We note that in the derivation of ^ and (10), the strategy was not used at all. So, one 
obtains a lower bound for D from a lower bound of KV, and thus from a lower bound for 

EV''^^" = gE'l/""'*'^'''s + (1 - g)E'l/"'''*'°^ (31) 

where q = P (server switching | server idle). 

Furthermore, we note that the cycle time satisfies 

(1 - po)EC = ro + /o. 

Workload while switching. Now we express ]E\/™itching ^^ terms of the (unknown) 
fi. This is completely analogous to the derivation for our concrete strategy with the 
arguments following (|I4J) replacing Tj by fi. The result is 



TgT/switching _ ^o(Po A^i=l Pi) . \ \ n ^^ 

2P (switching) EC • P (switching) ^ ^°" 



+ E 



Ti 



i-l / A^ i-1 \ N j-1 ■ 

(po - Pi)fi + ^fA ^ Pi + ^Pi]+ Y^ fi^ Pi 

i=l \l=i+l 1=1 J j=i+l l=i+l . 



m) 



Workload while waiting. Since we do not know the distribution of the waiting time, 
we cannot say much about the workload generated while the server is waiting. So, we will 
have to estimate at this point. First note that 

1 ^ 

P (waiting) ^ 
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where KV™ ™^ is the expected workload that is present in the system at a point in time 
when the server is waiting at station i and Pi = fi/KC. 

We cannot calculate the workload present at a point in time when we encounter 
the server waiting, EV^^^' ™^, for an arbitrary strategy, but we can give a lower bound. 
Namely, we can say that KV^^^ ™^ must be at least, on the one hand, the traffic that 
was accumulated at the other stations during the time that the server has already passed 
waiting at station i (that is, the expected backwards recurrence time). Additionally, since 
the decision of the strategy does not depend on the recent times the server has worked 



at the other stations nor the switchover times, we can also count the terms (15) and 



(16), that is the traffic that was accumulated (and is still present) while the server was 



switching and working at other stations, respectively. This gives 



EC^^"°' > 1^ ■ (po - P.) + E ( (/^'^^c' + /.) ■ Y. p^+^'^- E p^ 

•'^ kj^i \ je{i+i,...,k-i} je{i+i,...,k} 



This is the crucial observation in the derivation of the lower bound. 



Now, by Jensen's inequality (30), the last term can be yet bounded below by 



fi 

2/. 


(Po 


-p.)+E 


{pkEC + h) ■ 




k^i 


\ 


i6{j 


Furthermore, 


we 


need that 

q 




1-g 




P (switching) 


P (waiting) 



Y^ Pj + rk- Y P^ 

je{i+i,...,fc-i} je{i+i,...,fc} 



P (idle) 1 - Po 



(34) 



Then, putting (34) back into (33), and this and (32) back into (31) gives a lower bound 



for ¥y (and thus for D) only in terms of the /j. Minimizing over the /, leads to (O. 



Note that we cannot count the terms (19) and (20), since e.g. Zi and Fi are not 
independent. 



5 Different strategies and outlook 

There is another strategy, which we would like to propose here (we will refer to it as 
"Strategy 11" ), which is likely to be better than the one proposed so far (called "Strategy I" 
in this section) in terms of mean average delay. However, we are not able to analyse the 
mean average delay of Strategy II with the present methods. 

Strategy II is defined as follows. We consider a polling model as above, the only 
difference being that the credit Tj now refers to the total time the server spends at station 
i. More precisely, a server arrives at station i. It then works or waits at station i depending 
whether messages are present or not. At time Tj after its arrival at the station, it only 
finishes all the work that is present at that time (exhaustively). It does not turn idle 
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server switches from 

station i — 1 to 

station i 



server working 
(serving waiting/ 
arriving messages) 



server is waiting 
at station i 



server 
working 



server 
switches 
from St. i 

to St. i + 1 



server leaves 
station i — 1 



server arrives 

at st. i 



P 



queue at st. 
i is empty 



new message 

arrives in 
the queue of 

st. i 



time 



a + ^ + 7 = 
Ti credit is 
reached 



queue is empty 
and thus server 
leaves station i 



Figure 3: Operation of the polling model with Strategy II 

again at that station in the current cycle (i.e. into "wait-and-see"); whenever the station 
is empty, it starts switching to the next station. We refer to Figure [3] for an illustration. 

It is likely that Strategy II adjusted to its optimal waiting parameters gives a lower 
average delay than Strategy I adjusted to its optimal waiting parameters T^, . . . ,T^. 
Heuristically, Strategy II uses more information about the system, because it also counts 
the busy periods at the current station. However, we remark that even the determination 
of the cycle time, as in (|6]), does not seem to be possible in a straightforward way. 

We conjecture that for N = 2 and deterministic switchover times Strategy II provides 
the lowest mean average delay in the class of all strategies that are not allowed to use 
information of the queue status at the other station nor to look into the future of the 
system. 

Figure |4] gives a comparison of Strategy I and II, where the curve for Strategy II 
is obtained from simulations. We observed that the optimal credit for Strategy II is 
approximated by Tj* + piKC, where T* is the optimal parameter of Strategy I and EC is 
the cycle time of Strategy I for this optimal parameter. 

Certainly, one can define different strategies, where e.g. the server additionally has 
more information on the current queue status at the other stations. This may give an 
average delay that is even below the lower bound given in Theorem |9j However, note 
that even if the server is aware of the queue status at all stations, it is not completely 
clear what is the best decision at each moment in terms of lower average delay: switch or 
wait-and-see... 

Further, one can imagine a situation where the server may look into the close future of 
the incoming traffic at the present station; and it may thus decide to abandon the station 
before the end of its wait-and-see period, when it is clear that no traffic will arrive during 
that time. 

Acknowledgement. This work was supported by the DFG Research Center "Math- 
eon - Mathematics for key technologies" in Berlin. 
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re 

0) 

to 




1 . Exhaustive strategy 
■2. Lower bound 
-3. Wait-and-see strategy I 
-4. Wait-and-see strategy II 



model parameters: 



X^ = 0.3, X^ = 0.05, r^ = 5.5 



1, b2 = 1, b' 



,(2). 



1, b: 



,(2) 



„P,EC+T, 



6 8 10 

Wait-and-See credit T 



Figure 4: Comparison of the optimal credits with Strategy I and II 
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